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Abstract 

A shuffle of two strings is formed by interleaving the characters into 
a new string, keeping the characters of each string in order. A string 
is a square if it is a shuffle of two identical strings. There is a known 
polynomial time dynamic programming algorithm to determine if a 
given string z is the shuffle of two given strings x, y; however, it has 
been an open question whether there is a polynomial time algorithm 
to determine if a given string z is a square. We resolve this by proving 
that this problem is NP-complete via a many-one reduction from 3- 
Partition. 



1 Introduction 

If u, v, and w are strings over an alphabet S, then w is a shuffle of u 
and v provided there are (possibly empty) strings Xj and y, L such that u = 
x\x 2 ■ ■ ■ Xk and v = y\y 2 ■ ■ ■ yu and w = x\y\x 2 y2 ■ ■ ■ XkVk- A shuffle is 
sometimes instead called a "merge" or an "interleaving". The intuition for 
the definition is that w can be obtained from u and v by an operation similar 
to shuffling two decks of cards. We use w = u v to denote that w is a 
shuffle of u and v\ note, however, that in spite of the notation there can 
be many different shuffles w of u and v. The string w is called a square 
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provided it is equal to a shuffle of a string u with itself, namely provided 
w = u u for some string u. This paper proves that the set of squares is 
NP-complete; this is true even for (sufficiently large) finite alphabets. 

The initial work on shuffles arose out of abstract formal languages, and 
shuffles were motivated later by applications to modeling sequential exe- 
cution of concurrent processes. To the best of our knowledge, the shuffle 
operation was first used in formal languages by Ginsburg and Spanier [?]. 
Early research with applications to concurrent processes can be found in 
Riddle [?, ?] and Shaw [?]. Subsequently, a number of authors, including 
[?, ?, ?, ?, ?, ?, ?, ?, ?, ?] have studied various aspects of the complexity 
of the shuffle and iterated shuffle operations in conjunction with regular ex- 
pression operations and other constructions from the theory of programming 
languages. 

In the early 1980's, Mansfield [?, ?] and Warmuth and Haussler [?] stud- 
ied the computational complexity of the shuffle operator on its own. The 
paper [?] gave a polynomial time dynamic programming algorithm for decid- 
ing the following shuffle problem: Given inputs u, v, w, can w be expressed 
as a shuffle of u and v, that is, does w = u v? In [?], this was extended 
to give polynomial time algorithms for deciding whether a string w can be 
written as the shuffle of k strings u\ , . . . , Uk , so that w = u\ U2 • • • Uk , 
for a constant integer k. The paper [?] further proved that if k is allowed to 
vary, then the problem becomes NP-complete (via a reduction from EXACT 
Cover with 3-Sets). Warmuth and Haussler [?] gave an independent 
proof of this last result and went on to give a rather striking improvement 
by showing that this problem remains NP-complete even if the k strings 
ui, . . . , Uk are equal. That is to say, the question of, given strings u and w, 
whether w is equal to an iterated shuffle uQuQ ••• Qu of u is NP-complete. 
Their proof used a reduction from 3-Partition. 

The second author [?] has recently proved that the problem of whether 
w = u v is in AC 1 , but not in AC . Recall that AC (resp., AC 1 ) is 
the class of problems recognizable with constant-depth (resp., logarithmic 
depth) Boolean circuits. 

As mentioned above, a string w is defined to be a square if it can be 
written w = u u for some u. Erickson [?] in 2010, asked on the Stack Ex- 
change discussion board about the computational complexity of recognizing 
squares, and in particular whether this is polynomial time decidable. This 
problem was repeated as an open question in [?]. An online reply to [?] 
by Per Austrin showed that the problem of recognizing squares is polyno- 
mial time decidable provided that each alphabet symbol occurs at most four 
times in w (by a reduction from 2- Sat); however, the general question has 
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remained open. The present paper resolves this by proving that the prob- 
lem of recognizing squares is NP-complete, even over a sufficiently large fixed 
alphabet. 

The NP-completeness proof uses a many-one reduction from the strongly 
NP-complete problem 3- Partition (see [?]). 3-Partition is defined as 
follows: The input is a sequence of natural numbers S = {rii : 1 < % < 3m) 
such that B = (Yli=i n i)/ m is an integer and B/A < rii < B/2 for each 

1 £ [3m]. The question is: can S be partitioned into m disjoint subsequences 
Si, ... , S m such that each Sk has exactly three elements with the sum of 
the three members of equal to Bl Since 3-Partition is strongly NP- 
complete, it remains NP-complete even if the integers rii are presented in 
unary notation. 

2 Mathematical preliminaries 

Let w be a string of symbols over the alphabet £ with w = w\ ■ ■ ■ w n for 
Wi 6 S, so n = \w\. A string u is a subword of w if w = V\UV2 for some strings 
vi,V2- A string u' is a subsequence of w if u> = u' Qv for some string u. Both 
the subword u and the subsequence u' contain symbols selected in increasing 
order from w; the symbols of u must appear consecutively in w but this is 
not required for u' . The exponential notation u l , for i > 0, indicates the 
word obtained by concatenating i copies of u. If u%, . . . , Uf. are strings, the 
product notation Y\e=i u e indicates the concatenation u\U2 ■ ■ ■ u^-iUk- 

Now suppose that w is a square. Figure Q] gives an example of how a 
square shuffle w = u u gives rise to a bipartite graph G on the symbols 
of w. The graph G is defined based on a particular computation of w as a 
shuffle uQu as obtained by shuffling two copies of u0 The vertices of G are 
the symbols w\, . . . , w n of w, and, for each i, G contains an edge joining the 
symbol of w corresponding to the i-th symbol of one copy of u to the symbol 
of w corresponding to the i-th symbol of the other copy of u. W.l.o.g., if G 
contains an edge joining uij and Wk with j < k, then Wj corresponds to a 
symbol in the first copy of u, and corresponds to a symbol in the second 
copy of u. This can be done without loss of generality, possibly by changing 
the order in which the symbols of the it's are shuffled out to form w. (So we 
could instead define G as a directed graph if we wished.) 

The bipartite graph G has a special "non-nesting" property: if G con- 
tains an edge from to W£ and an edge from w p to w q , then it is not the 

1 In general, there may be several such ways to express it) as a square shuffle, even for 
the same u. 
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C\ X X X C2 C\ X X X C2 C\ X X C2 C\ X X C2 C\ X C2 C\ X C2 



Figure 1: Let w be the string {cix^ 1 C2) 2 {c\x 2 C2) 2 {c\XC2) 2 ■ This figure shows 
the bipartite graph G associated with the square shuffle w = u u with u 
equal to c\xxxc2XC2C\xc\xc2 ■ It is not pictured, but we also have w = vQv 

With V = C\X^ C2C\X 2 C2C\XC2- 

The string w can be expressed in product notation as rifc=o( c i a;3 ~ fcc 2) 2 - 




a b a b abba 



Figure 2: Examples of two crossing (and hence non- nested) edges for a graph 
on abab, and two nested edges for a graph on abba. Nested edges cannot 
appear in a graph obtained from a shuffle. 

case that k < p < q < £. This is because there are indices i and i' such that 
Wk and wg correspond to the i-th symbols of the first and second copies of u, 
and such that w p and w q correspond to the z'-th symbols of the two copies 
of u. (Compare to Figure 0) But then k < p implies i < i' whereas q < £ 
implies that i' < i, and this is a contradiction. 

In fact, as is easy to prove, if there is a complete bipartite graph G 
of degree one (i.e., a perfect matching) on the symbols of w which is non- 
nesting, then w can be expressed as a square shuffle w = u u so that G is 
the bipartite graph associated with this shuffle. 

The non-nesting property for G can also be viewed as an "anti-Monge" 
condition, namely as the opposite of the Monge condition. A bipartite graph 
on the symbols of the string w is said to satisfy the Monge condition pro- 
vided that, instead of having the non-nesting condition, it is prohibited that 
k < p < £ < q. In other words, the Monge condition allows nested edges 
but prohibits crossing edges. The Monge condition has been widely studied 
for matching problems and transportation problems. Many problems that 
satisfy the Monge condition or the "quasi-convex" condition are known to 
have efficient polynomial time algorithms; for these see [?] and the refer- 
ences cited therein. There are fewer algorithms known for problems that 
satisfy the anti-Monge property, and some special cases are known to be 
NP-hard [?]. This is another reason why we find the NP-completeness of 
the square problem to be interesting: it provides a hardness result for anti- 
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Monge matching in a very simple and abstract situation. 

The set of squares w is accepted by the following finite-state queue au- 
tomaton. A queue automaton is defined similarly to a PDA but with a 
queue instead of a stack. As usual, the automaton reads the input w from 
left to right. The automaton's queue is initially empty and supports the 
operations push-right (enqueue) and pop-left (dequeue). The automaton 
accepts if its queue is empty after the last symbol of w has been read. The 
non-deterministic algorithm for the automaton is as follows: 

Repeatedly do one of the following: 

a. Read the next input symbol a and push it onto the queue, or 

b. If the next input symbol a is the same as the symbol at the top of 
the queue, read past the input symbol a and pop a from the queue. 

When either step a. or b. is performed, we say that the input symbol a 
has been consumed. In case b., we say that the symbol a on the queue has 
been matched by the input symbol. Note that a. is always allowed, and b. 
only when the symbols match. 

A configuration of the automaton is a "snapshot" of the computation, 
and consists of the queue contents Q and the remaining part x of the input to 
be read. A configuration is denoted Q\\x. A single step from configuration C 
to configuration C is denoted C \~ C. A sequence of zero or more steps 
is denoted C\-*C . The condition C h C can hold in one of two ways: if 
C is Q||cra;, then either (a) C is Qcr||x, or (b) C is Q'\\x where Q = oQ' . 
The input w is accepted if e\\w r-*e||e, where e is the empty string. More 
generally, a configuration C is accepted provided Ch*e||£. 

If a computation proceeds as 

uiU2m\\xix 2 x 3 h* U2U3Z1WX2X3 h* u 3 ziz 2 \\x 3 , (1) 

then we say that the subword X2 of the input is consumed by the subword U2 
of the queue. This means that the symbols of X2 are either matched against 
symbols from 112, or are pushed onto the queue only after all the symbols 
of u\ have been popped and before any symbol of U3 is popped. In addition, 
no symbol of x\ or X3 is matched against a symbol from u%. The word Z2 
which is pushed onto the stack while X2 is consumed by U2 is called the 
resultant)?] The following two simple lemmas, which will be used in the next 
section, illustrate these concepts. 



2 Note that in (JXJ) also x\ is consumed by Ui with resultant z\. 
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Lemma 1. If X2 is consumed by U2 yielding the resultant z%, then U2 and Z2 
are subsequences of X2- Furthermore, X2 = U2 Z2- 

Proof. This holds since U2 is equal to the subsequence of symbols of X2 that 
are matched against symbols of U2, and Z2 is the subsequence of symbols 
of X2 which are enqueued and so not matched against symbols from U2- □ 

Lemma 2. Suppose eo,e are symbols that do not appear in the strings Ui, 
Xi, or v. Consider the string w = e^u\eu2e ■ ■ ■ eukeeoX\ex2e ■ ■ ■ ex^ev. Any 
accepting computation of w must proceed as: 

e\\w h* u\eu2eu^,e ■ ■ ■ euke\\x\ex2ex^e ■ ■ ■ extev (2) 

h* U2eu%e ■ ■ ■ eu^ez\ \\x2ex3e ■ ■ ■ ex^ev 

h* u-^e ■■■ eukez\Z2 \\x%e ■■■ ex^ev 

h* Ukezi ■ ■ ■ Zk-i Wx^ev h* z\ • • • Zk\\v l~*e||e, 

so that each X{ is consumed by the corresponding m with resultant zi. 

Proof. The two occurrences of eo must be matched with each other during 
the accepting computation. By the non-nesting property, this means that 
all the symbols between the two eo's must be pushed onto the queue instead 
of matching any prior symbol. At this point, there are exactly k many e's 
on the queue and an equal number of e's remaining in the input. The non- 
nesting property thus implies that the i-th occurrence of e pushed onto the 
queue must be matched against the i-th occurrence of e in the second half 
of w. From this it is evident, again by the non-nesting property, that the 
accepting computation follows the pattern ([2]) ; therefore each xi is consumed 
by □ 



3 Main Result 



Theorem 3. The set Square of squares is NP-complete. This is true even 
for sufficiently large finite alphabets. 

We shall prove the theorem for an alphabet with 9 symbols. A relatively 
straightforward modification of our proof shows that the theorem also holds 
for alphabets of size 7. We conjecture that Theorem [3] holds even for al- 
phabets of size 2, but this would require substantially new proof techniques. 
(Over a unary alphabet, Square is just the set of even length strings.) 

The rest of the paper is devoted to the proof of Theorem [3j Clearly 
the set of squares is in NP. To prove the NP-completeness, we shall give a 
logspace computable many-one reduction from 3-Partition to Square. 
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Consider an instance of 3- Partition S = (raj : 1 < i < 3m) such that 
the rij's are given in unary notation and such that B = (Ylf=i n i)/ m 18 an 
integer. We also have B/4 < i%i < B/2 for each i, but shall not use this fact. 
Without loss of generality, the values rii are given in non-increasing order 
(if not, then reorder them). The many-one reduction to Square constructs 
a string ws over the alphabet 

£ = {ai,a,2,b,eo,e,ci,C2,x,y}, 

such that ws is a square iff S is a "yes" instance of 3-Partition. The 
string ws consists of three parts: 

ws '■= (loader^) (distributors) (verifiers). 

These are defined by 



(loaders) = e fJ(6 2B e) 

i=i 

m 

(distributors) = eo JJ((oi6 B a2) 3 e) 



i=i 

3m 



(verifiers) = ]^[ [v4k-3D k v 4k _ 3 v 4k _ 2 DkV4k-2V4k-iEkV4k-iV4kFkV4k\ 
where 



fc=i 



V£ = c±x y c 2 



D k = {a\b nk a 2 ^ m - k+1 



2) 

E k = (a 2 1 b B al) 3m - k (a 1 b nk a 2 )(a 2 1 b B al) 3ll, - L 
F k = {a\b B alf^ m ^ 



It is useful to let Ue := afb a 2 as this lets us shorten the expressions for 
D k , E k , and F k , so D k = U 3 ™~ k+ \ E k = U 3 B m ~ k ai b n *a 2 U 3 B m - k , and F k = 

u B 

The length of ws is quadratic in m + Yli n ii so w s i s polynomially 
bounded. It is clear that ws can be constructed from S by a logspace 
computation. 

The actions of the loader and distributor are relatively easy to under- 
stand, so we describe them first. As the next lemma states, the intended 
function of the loader is to place m many blocks of 2B many 6's, separated 
by e's, onto the queue. 
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Lemma 4. Any accepting computation for wg starts off as 

e\\w s \~* e (6 2jB e) m ||(distributor s )(verifier s ). 

In the subsequent part of the accepting computation, the i-th occurrence of 
the subword (ai&^a^) 3 in (distributor 5) will be consumed by the i-th occur- 
rence of b 2B in the queue. 

Proof. This is an immediate consequence of Lemma [2] since there are only 
two occurrences of eo in wg, and since e has the same number of occurrences 
between the two eo's as after the second eo- □ 

Consider how the subword (aib B a2) 3 can be consumed by b 2B . Since 
there are no a\s or o^'s in b 2B , the a\s and a^'s must be pushed onto the 
queue. In addition, exactly 2B of the 3B many occurrences of b in {a\b B a2)^ 
must be matched against the symbols of b 2B . Thus, when the subword 
(a\b B a2) 3 is consumed by b 2B a resultant string of the form ai&^^ai^o^ai&^a^ 
must be pushed onto the queue where j\ +J2 +33 = 3B — 2B = B. Since the 
automaton is non-deterministic, any such values for j\ , J2 , j'3 can be achieved. 
These observations, together with Lemma HI prove Lemma [5) 

Lemma 5. Given any sequence of non-negative integers (ik)k^i such that 

Vj G {1,2,... ,m}, i Z j-2 + %-i + kj = B, (3) 

there exists a computation e\\ws l~* rifc=i( a i^ fea 2)ll (verifier g). Conversely, 
if e\\ws \~*W || (verifier g) then W must be of the form rifc=i( a i^ fca 2)j so that 
condition (0j holds. 

We now turn to analyzing the effect of (verifier g). By LemmaEJ any ac- 
cepting computation for e\\wg reaches a configuration ]^[|!!! 1 (ai6 2fe a2) || (verifier^) 
satisfying ((3|). The intuition is that the sets Sj := {«3j-2, izj-i, hj} should 
be a solution to the 3-Partition problem S. By ([3]), the members of 
each Sj sum to B. Thus, the sets Sj are a solution to the 3-Partition iff 
the sequence (ik)k=i is a permutation (a reordering) of S = (^fe)fc=i- 

By Lemma El to complete the proof of Theorem it suffices to show 
that n^™ 1 (ai& l '=a2)||(verifier,s') is accepted if and only if the sequence (ifc)f=i 
is a permutation of S. We first prove the easier direction of this equivalence: 

Lemma 6. Suppose (ife)?^ is a permutation of S. Then the configuration 
n^ 1 (ai6 Jfc a2)||(verifier5) is accepted. Therefore, if S is a "Yes" instance of 
3-Partition, then e \\wg^r*e\\e and wg is in Square. 
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We prove Lemma [6] after first proving Lemmas [9] and [TUJ 

Definition 7. A computation accepting ws satisfies the V- Condition pro- 
vided that for each t (for 1 < t < 12m) the second occurrence of the subword 
V£ in ws is consumed by the first occurrence of V£ in ws- This means that 
the symbols of the second V£ are completely matched by those of the first v^. 

Theorem Q3] below will prove that the V-Condition must hold, but for 
now it suffices to just assume it. 

Definition 8. A string z has k alternations of the symbols ai,a 2 provided 
(aia 2 ) fc is a subsequence of z but (aia 2 ) k+1 is not. 

Lemma 9. Let i±, . . . ,i3 m -k+l be natural numbers, and W = Yl^=i k+1 (aib l ^a 2 ). 
Suppose the V- Condition holds for a computation containing the subcompu- 
tation 

W\\v 4k _ 3 D k v Ak „ 3 v 4k ^ 2 D k v 4k _ 2 (- ■■){-* W'\\(- ■ • ). (4) 

(The "(■■■)" denotes the rest of the input string.) Then W = W, and 
ij < n k for all j. Conversely, if each ij < n k , then the subcomputation (Jl]) 
can be carried out. 

Since W' = W, the computation (j4|) might seem to achieve nothing, and 
thus be pointless; the point, however, is that it ensures that the values ij 
are < n k . This will be useful for the proof of Lemma fTTl 

Proof. By the V-Condition, and the non-nesting property, the computa- 
tion Q must have the form 

W\\v 4k _ 3 D k V4 k _ 3 V 4k _ 2 DkV4k-2(- • • ) h * W"\\v 4k ^ 2 DkV4k-2(- • • ) h * W'lK" ■ ■ )) 

where W" is the resultant when the first D k is consumed by W, and W 1 is 
similarly the resultant when the second D k is consumed by W" . 

W and D k both have 3m — k + 1 alternations of a\, a 2 . Therefore, when 
D k is consumed by W, the j-th ai (resp., a 2 ) symbol in W must match an a± 
(resp., a 2 ) from the j-th block a\a\ (resp, 0202) in D k . The other a\ (resp., 
a 2 ) in that block is pushed onto the queue as part of W". Furthermore, the 
subword b 1 ? in the j-th component of W must match ij of the 6's in the j-th 
occurrence of b nk in D k \ this leaves n k — ij many 6's to be pushed onto the 
queue as part of W". This is possible if and only if ij < n k for all j, and if 

so, w = nf=r* +1 M nfc -^ 2 ). 

The second D k must be consumed by W" , and the same argument shows 
that this means W = n?=r fc+1 ( a i^ a 2) = W. □ 
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Lemma 10. Let i\, . . . , i^m^k+i be natural numbers, and W = H|=i 
Suppose ij = maxjjij} = n^. Lei i^, . . . ,i' 3m _ k be the sequence with ij 
omitted, and let W' = Y\^=i k \a\b % i 02) ■ Then there is a computation 

WWvAk-xEkV^xVAkFkViki- ■■)!-* W\\(- ■ ■ ). (5) 

The computation ([5]) will satisfy the V-Condition. Lemma [12] below 
will prove a converse to Lemma [TOl under the additional assumption of the 
V-Condition. Lemma [TUI however, is all that is needed for Lemma [6) 

Proof. We construct a computation of the form 

W\\v4k-iE k V4k-iV4kF k V4k(- ■•)!-* W"\\v 4k F k VAk{- ■ • ) • • ). (6) 

Recalling that E k = C/| m ~ A } a 1 b Hk a 2 U^ n ~ k and using ij = n k , the first half 
of the computation ([6]) has the form 

3m— fc+1 

[J («i^«2)lb4fc-i^ m - fe ai6 nfc a2^1 m - fc ^-i 

3m-fc+l J-1 

h* n (ai6 i % 2 )^ fc -in(°i 6B ^ a 2)ii^ -fe " (j-i) «i^ fc «2^i m -wi 

3m-fc+l J-1 

h* J] (ai^a 2 )« 4 fc-l n( a i 6B ^ a 2)^^ (J_1) ||aife nfc a2^1 m -^4 fc -i 
j=j j=i 

3m-fc+l J-1 

h* J] (ai6^a 2 )«4fc-i \\{a 1 b B -^a 2 )uf l - k - {J - 1) \\U^- k v ik -i 
j=j+i j=i 

3m-fc+l J-1 

j=j+i j=i 

J-1 3m-fc+l 

Btt-in^^^)^* n («i& s ^«2)n«4 fc -i 
i=i i=j+i 

J-1 3m-fc+l 

j=i j=j+i 

J— 1 3m— fc 

= l[(aibP- i ia 2 )U% n - k n (ax^aaJHe = W"||e. 
3=1 j=J 
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The first and fifth steps shown above use the fact that when a\b B a\ is 
consumed by a\b^a2 the resultant is a\ b B - ~ lj d2 as shown in the proof of 
Lemma[9j The third step matches a\b li a 2 with the equal a\b nh a 2 - The final 
step matches v^-i- The other steps push words v^-i and Ub from the 
input to the queue. 

The second half of the computation ([6]) proceeds as follows: 

J— 1 3m— k 

X\{ ai b B -^a 2 )U B m - k f[ (a.b^a^Ma^alf^v^ 

3=1 3=J 

/c 3 1 

3=3 3=1 
3m— k J— I 

h* (aifr^oa)^ n(°i &i5 °2) II {a\b B alf m - k ^ J -^v ih 

3=J 3=1 

3m— k 3m— k 

^* v 4k Yl (oi^aa)||«4fc ^* II (oi^02)||e- 

3=1 3=1 

This is easily seen to be a correct computation. This proves Lemma [101 □ 

We can now prove Lemma [H Suppose that S = (nj)j™i and that 
is a permutation of (Wj)|2i witnessing that S 1 is a "Yes" instance of 3- 

Partition. Let W k be the string \^i k+1 {aib 1 ^ a 2 ) where i^, ... ,i' 3m _ k+1 
is the sequence obtained by removing k — 1 of the largest elements of the 
sequence (ij)^™-^. (When there are multiple equal values ij, they can be 
removed from the sequence in arbitrary fixed order, say according to the 
order they appear in the sequence). The n k s are non- increasing, so the 
maximum i'j is equal to n k - Therefore, Lemmas l9l and [TOl imply that 

W k \\v4 k -3DkV4k-3V4k-2DkV4k-2V4k-lEkV4k-lVAkFkV4k \~* Wfc+l||e. 

Combining these computations for 1 < k < 3m gives (verifier s) h*e||e. 
Lemma[5]gives eHiygr-*^!! (verifiers). Thus e\\ws h*e||e. This proves Lemma[6l 
□ 

The next lemma gives the converse of Lemma [6j under the assumption 
that the V-Condition holds. This, together with Theorem [14] stating that 
the V-Condition must hold, will prove Theorem [3j 
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Lemma 11. Let S be an instance of 3-Partition and (ifc)|=i sa ^ s fy the 
conditions of Lemma\^and W = nfc=i( a i^ a 2)- Suppose that W\\ (verifiers) |-*e||e 
with a computation that satisfies the V-Condition, so e ||it? 1 sfl-*e||e and ws is 
in Square. Then S is a "Yes" instance of 3- Partition. 

The main new tool needed for proving LemmafTTlis a converse of LemmallOl 

Lemma 12. Let 1 < k < 3m, let i\, . . . ,i^ m ^k+i be natural numbers, and 
W k = Y\^i k+1 {aib^a 2 ). Suppose that maxjjij} < rifc. Further suppose 
there is a computation 

WkWv^iEkV^^v^FkV^i- ■■) h* W fc+ i||(- • • ) (7) 

that satisfies the V- Condition. Then there is a J such that ij = maxjjij} = 
nfc such that, letting i[, . . . ,i' 3m _k be the sequence (ij)j with ij omitted, we 

have W k+ i = f\f={ k '{axb^ a 2 ) ■ 

Before we prove Lemma [T2"| we indicate how it, and the V-Condition 
assumption, imply Lemma [IT] and thus imply Theorem [3l Suppose C is a 
computation e\\ws h*e||e that obeys the V-Condition. For 1 < k < 3m + 1, 
define the strings Vk to be such that C contains the configurations 

3m 

Vk II J\ [vAl^DlV^^VAt^DlVil^VAl-lEllVu^iVuFiVu] . 

e=k 

Of course, these V^'s are the intermediate queue contents as (verifiers) is 
consumed. For 1 < k < 3m, define VI to be the strings such that C contains 
the configuration 

Vk II v Ak-\EkVik-l v AkFk v Ak 
3m 

j [ [vM~3D£VM-zVU-2De,VM-2VAl-lElVM-lVuFzVu] . 
l=k+l 

Claim 13. We have: 

(a) V\ is equal to Y\^™i k+1 (aib' lj 02) for some sequence satisfying ([3]). 

(b) For 1 < k < 3m + 1, Vk equals Y\f=i k+1 (o>lb ij 02) for some sequence 

{i'j)j which is obtained from by removing (instances of) the 

k — 1 largest entries of (nj)^™-^. 

(c) For 1 < k < 3m, V k ' equals Vk, and its maximum i'j value is less than or 

equal to n k . 
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The claim is proved by induction on k. Part (a), and the equivalent 
k = 1 case of (b), follows from Lemma Part (c) for a given k follows 
from Lemma [9] and from the induction hypothesis that (b) holds for the 
same value of k. Part (b) for k > 1 follows from Lemma [12] and from the 
induction hypothesis that (b) and (c) hold for k — 1. Since V^ m+ i = e, 
part (b) implies that the sequence is a reordering of And, 

since ([3]) holds, witnesses that S is a "Yes" instance of 3- Partition. 

This completes the proof of Lemma [Tl] and thereby Theorem [3] modulo the 
proofs of Lemma [12] and Theorem [TU □ 

Proof, (of Lemma [T2l) Consider a particular computation C as in (|7|) that 
satisfies the V-Condition. C has the form 

^fclKfc-i^fc^fc-i^fcFfc^- ••)!-* Z\\v 4k F k v 4k h* Wfe + i||(- • • ) 

where Z is the resultant of E k being subsumed by W k . By assumption, 
W k has 3m — k + 1 alternations of ai,a2, whereas has 2(3m — /c) + 1 
and Ffc has 2 (3m — k). The string E k is a concatenation of "blocks" of the 
form a\b nk a2 or the form Ub = a\b B a\. Each subword a\b lj a2 in W has its 
symbol a\ matched by some a\ in E k and its 02 matched by some 02 in the 
same block or a later block of E k : these symbols a\ and 02 in E k determine 
a contiguous sequence of blocks in E k which is consumed by 016^02- We 
call these blocks the "j-consumed" portion of E k , and denote it Yj. The 
resultant of aib lj CL2 and its j-consumed portion is denoted Zj. There may 
also be blocks of E k which are not part of any j-consumed portion, and 
these are called "non-matched" blocks of E k . The string Z is then the 
concatenation of the words Z^ for 1 < j < 3m — k + 1, interspersed with 
the non-matched blocks of E k . 

Let us consider the possible resultants Zj. We can write E k as E k = 
P 1 P 2 P 3 where P 1 = P 3 = U^ l ~ K and P 2 = a\b nh ai. There are several cases 
to consider. 

Case a. Yj is (aj^aj)' for some I > 1, and thus is a subword of either 
Pi or P3 in E k . When Yj is consumed by 016^02, one of the two 
initial ai's, any i,- of the 6's, and then one of the two final 02 's are 
matched; the remaining symbols of Yj become the resultant Zj and 
are pushed onto the queue. Therefore, Zj is equal to 

i 

Zj = ai b B - mi J\{ala\b B - ms )a 2 (8) 

s=2 
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where m-i + m<i + • • • + me = ij. Note that Yj and Zj both have I 
alternations of a±,a2- 

Case b. Yj spans from Pi to P 3 and equals {a\b B ''a^Y 1 a\b nk a 2 {a{b E ''affi 2 
where £1,^2 > 1- Arguing as in the previous case, Zj is equal to 

ai t>B-rni^ a 2 a 2 b B-m^ aib n k -m h+la2 TJ ^^B-m^ 
s=2 s=£ 1 +2 

where mi + mi + • • • + m^ 1+ ^ 2+ i = ij. In this case, Yj and Zj both 
have i\ + £2 alternations of di , 02- 

Case c. Yj is a\b nk ai, namely, Yj = P 2 . In this case, Zj is equal to just 
5™fe-*j . if jj . = refc; then Zj is just e: this is called a "full cancellation" 
case. Note that Zj has zero alternations of 01,02, whereas Yj has one 
alternation. 

Case d. Yj is (af b B a^) e a\b nk a2. We now have 

Zj = a l b B - mi Y[(ala 2 1 b B - m °)ala l b nk - m *+K (9) 

where mi + • • • + m£ + i = ij. Zj consists of a part with £ alternations 
of 01,02 followed by a subsequent a\ (and possibly b's). In the "full 
cancellation" case, m^ + i = n^, and since ij < n&, we have = 
m s +i = ij and, for s < I, m s = 0. Otherwise, Zj ends with one or 
more Vs. 

Case e. The case where Yj is a\b nk ai{a\b B 'a%) is completely analogous to 
case d., and we omit it. 

For simplicity, let's assume for the moment that neither case d. nor e. occurs. 
This means that there is at most one occurrence of either case b. or c, and 
the rest of the cases are case a. In cases a. and b., Zj has the same number of 
alternations of a\, a% as Yj. Of course the number of alternations in the non- 
matched blocks does not change. Therefore, Z has 2(3m — k) + l alternations 
of a\, 02 if case c. does not occur, and has 2(3m — k) alternations if case c. 
does occur. The word has 2(3m — k) alternations of ai,ci2, and since 
is consumed by Z, Lemma [1] implies that Z cannot have more alternations 
of ai,d2 than F^. Therefore, it must be that case c. occurs and case b. does 
not. 
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We claim that case c. must occur as a full cancellation case. If not, 
then Z will consist of a subword with 3m — k alternations of 01,02 that 
came from Pi, followed by some non-zero number of 6's from the Zj of 
case a, and then by another subword with 3m — k alternations of 01,02 that 
came from P3. In other words, (aia2) 3m_fc 6(ai02) 3m_fe is a subsequence 
of Z. It is not, however, a subsequence of F/~, contradicting the fact that 
Fk is consumed by Z. If follows that case c. must have occurred in the full 
cancellation version. Let J be the value of j for which case c. occurred; since 
it was a case of full cancellation, ij = n&. 

Therefore, Z has 2(3m— k) alternations of ai, 02, and is the concatenation 
of the 3m — k many Zj's that arose in case a. (the empty Zj has been 
dropped) and of zero or more non-matched af&^a^'s. The fact that and 
Z both have 2 (3m — k) alternations of ai, 02, means that the way Fk can be 
consumed by Z is tightly constrained. First, any non-matched block a\b B a^ 
in Z must consume (and fully match) an identical block in Fk leaving a 
resultant of e. Second, any Zj with i alternations of 01,02 will be of the 
form © and must consume a subword Gj = {a\b B a\Y of F^. The first 01 
of Zj must match one of the two first ai's of Gj] the final 02 of Zj must 
match one the final two 02 's of Gj; the other subwords a\ and a| of Zj must 
match identical subwords in Gj] and the IB — ij many 6's in Zj all must 
match 6's in Gj. This can always be done, no matter what the values of 
the m s 's in Zj are. Since Gj has £B many 6's, the consumption of Gj by Zj 
yields a resultant W'j equal to a^b^ai. 

It follows that, when Fk is consumed by Z, the resultant equals the con- 
catenation of the strings W'j = aib lj a,2, omitting the word wj (which trig- 
gered case a). In other words, the resultant is just Wk+i, proving Lemma[T2l 
in this case. 

We still have to consider the case where case d. or e. occurs. The cases 
are symmetric, so suppose case d. occurs, and thus the rest of the Zj's are 
generated by CclSG EL. Suppose Zj is obtained via case d., and so is equal 
to ([9]). We claim that this must be a full cancellation case of case d., with 
rik = ij- If not, then Z contains 3m — k alternations of 01,02 up through 
Zj, followed by the final ai of Zj and at least one 6 at the end of Zj, and 
then followed by 3m — k alternations of 01,02 in the remaining part of Z. 
In other words, (aia2) 3m_fc ai6(ai02) 3m_fc is a subsequence of Z. It is not a 
subsequence of Fk however, contradicting the fact that Fk is to be consumed 
by Z. Thus we must have a full cancellation case of case d. 

Now consider what immediately follows Zj in Z. It must either be of 
the form a\b B a\ (obtained from a non-matched block), or, referring to ([8]), 
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be the word of the form 

V 

Z j+1 = ai b B ~<\{(alalb B - m '°)a 2 . 

s=2 

obtained from for Yj + i. We claim it is impossible for Z^a\b B a\ to be 

a subword of Z. If so, {a\a 2 Y m ~ k a\(a\a 2 f' rn ~ k is a subsequence of Z, and 
thus Z is not a subsequence of F^. As before, this is a contradiction. 

We have eliminated the other possibilities, so ZjZj + \ is a subword of Z 
and rik = m^+i. Therefore, m s = for all s < £, and we have 

I' 

Z 3 Z j+l = a 1 b B (a 2 2 ajb B YYl{a 2 2 a 2 1 b B -<)a 2 . 

s=l 

Note that ZjZj + \ contains I + £' alternations of ai,a 2 . Also note that 
the subword YjYj + \ contains £ + £' + 1 many such alternations. There- 
fore Z has 3m — A; alternations of ai,a 2 , namely one fewer than (as 
desired). Similarly to the argument four paragraphs above, it follows that 
ZjZj + \ must consume a subword G of of the form {a\b B a 2 ) ■ Since 
m[ + • • • + m' e , = ij+i, ZjZj + i has (I + £')B — ij many 6's. Hence the resul- 
tant when G is consumed by Zj is equal to aib lj+1 a 2 . If follows again that 
when Ffc is consumed by Z it yields the resultant Wk+i as desired. 

This completes the proof of Lemma [T2J □ 

The V-Condition. The proof of Theorem [3] will be finalized once we 
prove that the V-Condition must hold: 

Theorem 14. Any accepting computation e\\ws satisfies the V-condition. 

Let 

l-i 

v = Hvii = n ( C1 *vc2) 2 , (io) 

i=0 j=t,-, 2,1 

i.e., V = V(Vi ■ ■ ■ v 2 v 2 v\V\. (The dependence of V on I is suppressed in the 
notation.) The symbols c\,x, y, c 2 occur only in the subwords ve of ws, and 
V is the subsequence of ws containing these symbols, but in reversed order. 
(We use the reversed order since it makes the proof below a little simpler 
to state.) Clearly, any expression of ws as a square shuffle induces a square 
shuffle for V. Therefore Theorem 1141 is a consequence of Theorem 1151 

Theorem 15. Let £ > 1. The only accepting computation e||V r h*e||e is the 
one that matches each in V with the other Vk in V . 
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As a side remark, it is interesting to note that Figure [T] illustrates that 
Theorem [15] would not hold if the Vj's were instead defined to equal c\x 3 C2 
with the y's omitted. Theorem 1151 follows from the next three lemmas. 

Definition 16. Each subword x 3 or y 3 shown in the definition of V in fllQf) 
is called an x-block or a y-block, respectively. We also refer to them as 
full x-blocks or full y-blocks after they have been pushed onto the queue to 
emphasize that the complete subword x 3 or y 3 has been pushed onto the 
queue without any x or y from the block being matched. 

Lemma 17. If C is an accepting computation ofV , then C does not match 
any x (resp., y) with another symbol from the same x-block (resp. y-block). 

Proof. V contains an even number of ci's and an even number of C2 ! s. Con- 
sider some x- or y-block f3 in C. There is either an odd number of ci's before 
(and therefore, after) /3 in V, or an odd number of C2's before (and after) 
{3 in V. If there are, say, odd numbers of c\s then some c\ before f3 must 
match some c\ after f3 during C. The non-nesting condition now implies 
that no two symbols in f3 can be matched. □ 

Lemma 18. Suppose C is an accepting computation for V , and C does not 
completely match the first subword vi ofV with the second vi ofV (i.e., at 
least one symbol from the second vi of V is pushed onto the queue). Then 
there is a point in C where the queue contains either two full x-blocks or two 
full y-blocks. 

Proof. The proof splits into cases depending on how C starts off. For the 
first case, suppose the first c\ of V does not match the second c\ of V. 
By the non-nesting condition, this implies that the subword x^y^cix^ is 
pushed onto the queue. This puts two full x-blocks and two full y-blocks on 
the queue, so the lemma holds in this case. So, henceforth assume that the 
first c\ matches the second c\. 

Now suppose the first x-block x e does not completely match the sec- 
ond x l '. Therefore, some of the x's in the first x f ' match symbols from some 
x 3 with j < I. This x-block x 3 comes after the first two y-blocks (which 
equal y e ), so by the non-nesting condition, these two y-blocks are on the 
queue by the time the algorithms consumes the x-block x 3 . So the lemma 
holds in this case as well. Assume henceforth that the first cix^ is completely 
matched with the second c\x e by C. 

Finally, suppose that the first subword y l C2 does not completely match 
the second y^C2- In this case, we claim that, after consuming the second C2, 
C's queue will contain y m C2y m C2- To see this note that either the two 
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y^'s completely match (so m = 0) and then the C2's are not matched by 
assumption, or the two y^'s do not completely match (so m > 0) and then 
the C2's must be pushed to the queue since they cannot be matched while 
a y is at the top of the queue. At any rate, the queue contains two C2's 
once the second C2 is consumed. By the non-nesting property, the second C2 
on the queue must match the fourth C2 of V or a later C2 of V. Therefore, 
the two x-blocks x^ _1 that come prior to the fourth C2 are pushed onto the 
queue, and the lemma holds again in this case. □ 

Lemma 19. If C is an accepting computation for V and at some point 
in C the queue contains two full x-blocks (respectively, contains two full 
y-blocks), then there is a later point at which the queue contains two full 
y-blocks (respectively, contains two full x-blocks). 

Proof. Suppose C has two full x-blocks x m and then x J in the queue. Note 
j < m. Let the computation continue until x m has been matched, and then 
until x 3 has been matched. The symbols of x m are matched by symbols from 
x-blocks x s that have s < m (since the block x 3 was intervening). Therefore, 
x m 's symbols must match x's from at least two distinct x-blocks. Between 
these two x-blocks there is a y-block, and by the non-nesting condition this 
y-block is pushed onto the queue in its entirety. Similarly the x-block x 3 
is matched against symbols from at least two distinct x-blocks, and again 
there is a y-block between those two x-blocks that is entirely pushed onto 
the queue. Therefore, once the x 3 is matched, there are at least two full 
y-blocks in the queue. 

The dual argument works with x and y interchanged. □ 

We can now prove Theorem 1151 

Proof. The proof is by induction on I. The base case I = 1 is trivial. Suppose 
£ > 1. If an accepting computation C matches the first two subwords 
c\x e y £ C2 against each other completely, then the rest of the computation C 
is an accepting computation on the rest of V, namely V minus these first two 
subwords. By the induction hypothesis, the latter accepting computation 
matches each pair of subwords c\x 3 y 3 C2, and the theorem holds. Otherwise, 
if the first two subwords c\x IL y e 'C2 of V are not completely matched by C, 
then Lemma [T8l states that C contains some point where its queue contains 
either two full x-blocks or two full y-blocks. Lemma [19] then implies that 
C's queue must contain two full x- or y-blocks infinitely often, which is a 
contradiction. □ 



18 



That completes the proof of Theorem [T5| and thereby the proof of The- 
orem PHI giving us the V-Condition that was needed for the proof of Theo- 
rem [3l 
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